Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
نویسندگان
چکیده
Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recently, ChIP-seq in mapping TF provides a large amount of experimental data to analyze co-TFs. Several studies show that if two TFs are co-associated, the relative distance between TFs exhibits a peak-like distribution. In order to analyze co-TFs, we develop a novel method to evaluate the associated situation between TFs. We design an adjacency score based on ordered differences, which can illustrate co-TF binding affinities for motif analysis. For all candidate motifs, we calculate corresponding adjacency scores, and then list descending-order motifs. From these lists, we can find co-TFs for candidate motifs. On ChIP-seq datasets, our method obtains best AUC results on five datasets, 0.9432 for NMYC, 0.9109 for KLF4, 0.9006 for ZFX, 0.8892 for ESRRB, 0.8920 for E2F1. Our method has great stability on large sample datasets. AUC results of our method on all datasets are above 0.8.
منابع مشابه
Stress-Strength and Ageing Intensity Analysis via a New Bivariate Negative Gompertz-Makeham Model
In Demography and modelling mortality (or failure) data the univariate Makeham-Gompertz is well-known for its extension of exponential distribution. Here, a bivariate class of Gompertz--Makeham distribution is constructed based on random number of extremal events. Some reliability properties such as ageing intensity, stress-strength based on competing risks are given. Also dependence properties...
متن کاملCENTDIST: discovery of co-associated factors by motif distribution
Transcription factors (TFs) do not function alone but work together with other TFs (called co-TFs) in a combinatorial fashion to precisely control the transcription of target genes. Mining co-TFs is thus important to understand the mechanism of transcriptional regulation. Although existing methods can identify co-TFs, their accuracy depends heavily on the chosen background model and other param...
متن کاملIncorporating Motif Analysis into Gene Co-expression Networks Reveals Novel Modular Expression Pattern and New Signaling Pathways
Understanding of gene regulatory networks requires discovery of expression modules within gene co-expression networks and identification of promoter motifs and corresponding transcription factors that regulate their expression. A commonly used method for this purpose is a top-down approach based on clustering the network into a range of densely connected segments, treating these segments as exp...
متن کاملBayesian Joint Prediction of Associated Transcription Factors in Bacillus subtilis
Sigma factors, often in conjunction with other transcription factors, regulate gene expression in prokaryotes at the transcriptional level. Specific transcription factors tend to co-occur with specific sigma factors. To predict new members of the transcription factor regulon, we applied Bayes rule to combine the Bayesian probability of sigma factor prediction calculated from microarray data and...
متن کاملOn prediction of regulatory genes by analysis of C.elegans functional networks
Connectivity networks have recently become widely used in biology due to increasing amounts of information on the physical and functional links between individual proteins. This connectivity data provides valuable material for expanding our knowledge far beyond the experimentally validated via mathematical analysis and theoretical predictions of new functional interactions. In this paper we dem...
متن کامل